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This study aims to hide patient’s privacy details of digital imaging and 
communications in medicine (DICOM) files using the quick response (QR) 
code images with the same size using steganographic technique. The 
proposed method is based on the properties of the discrete cosine transform 
(DCT) of the DICOM images to embed a QR code image. The proposed 
method includes two parts: data embedding and extraction process. 
Moreover, the stego DICOM image could be blindly used to produce the 
embedded QR code image without the existence of the original DICOM 
image. The performances of proposed method were evaluated using the 
metrics of the peak signal to noise ratio (PSNR), the structural similarity 
index (SSIM), the universal quality index (UQI), the correlation coefficient 
(R) and the bit error rate (BER) values. The experimental results scored a 
high PSNR after the embedding process by embedding a QR code image 


QR code into the DICOM image with the same size. 
This is an open access article under the CC BY-SA license. 
(OKO 
Corresponding Author: 
Ahmed Elhadad 
Department of Computer Science, Faculty of Computer and Information Sciences, South Valley University 
Qena, Egypt 


Email: ahmed.elhadad @sci.svu.edu.eg 


1. INTRODUCTION 

Digital imaging and communications in medicine (DICOM) is the standard medical imaging data 
format, which is most used for the exchanging, management and viewing of medical images, such as 
computerized tomography (CT) scans, magnetic resonance imaging tests (MRIs), and ultrasound images [1]. 
The DICOM format was developed and copyrighted by the National Electrical Manufacturers Association 
(NEMA) [2], [3]. It has been broadly espoused by hospitals and is making inroads into smaller applications 
like labs and dentists' and doctors' offices. The DICOM frequently parsed and enables the integration 
between different devices and platforms which using the picture archiving and communication system 
(PACS) [4]-[8]. 

The medical information of patients must be protected from unauthorized access. Generally, the 
medical information security refers to the rules of security policies that strict ethics rights and privacy of the 
patient and must be concerned by entities. There are many widely used standards and tools for protecting 
personal medical information such as the ISO27799 (security management in health using ISO/TEC/27799) 
[9], cryptography and steganography techniques [10]-[13]. The ISO27799 is a standard which provides the 
security management guidelines for health organizations to protect medical information. Cryptography is a 
data protection using encryption provider which converts the medical information into unintelligible text 
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using secure key. Steganography can be defined as the use of a host (container) data to hide or embed a piece 
of information that is hidden directly in media content, in such a way that it is imperceptible to a human 
observer but can be detected/extracted easily with a computer [14], [15]. 

The quick response code is always abbreviated to QR code which is a barcode that is readable by an 
imaging device such as a camera and smartphones. The QR code system was originally invented and 
designed in 1994 by the Japanese Company Denso Wave and it was registered as a trademark of the same 
company [16], [17]. Simply, the QR code is a matrix code of two-dimensional barcodes and consists of black 
squares arranged in a square grid on a white background. Unlike the one-dimensional barcodes that were 
designed to be scanned by a narrow beam of light, the QR code is scanned by a digital image sensor and then 
digitally analyzed by a programmed processor. The QR code includes three main distinct squares at the 
corners to set up the image size normalization, orientation, and angle of viewing. Moreover, the small dots 
throughout the QR code are then converted to binary numbers and validated with the Reed-Solomon 
error-correcting algorithm [18] which are encoded as bytes of 8 bits. In practice, QR codes often contain data 
for a locator, identifier, or tracker that points to a standard URL for a website or application. A QR code uses 
four standardized encoding modes (numeric-Max. 7,089 characters, alphanumeric- Max. 4,296 characters, 
byte/binary-Max. 2,953 characters, and kanji-Max. 1,817 characters) to store the amount of data efficiently; 
extensions may also be used [18]. 

The rest of this paper is organized as follows: section 2 presents the used materials and methods. 
The proposed method is illustrated in detail in section 3. Section 4 shows the experimental results and 
discussions. Finally, the study and the main contributions are concluded in section 5. 


2. MATERIALS AND METHODS 
2.1. The cancer imaging archive (TCIA) COVID-19 dataset 

This study aims to hide patient’s privacy details of DICOM files using the QR code images 
therefore, TCIA COVID-19 dataset is used [19]—[22]. TCIA COVID-19 dataset is a chest radiology imaging 
with CT imaging studies, a set of clinical data and key radiology findings for patients representing a rural 
COVID-19 positive population in the state of Arkansas. These data are cross-linked to severe acute 
respiratory syndrome coronavirus 2 (SARS-CoV-2) complimentary DNA (cDNA) sequence data extracted 
from clinical isolates from the same population, uploaded to the GenBank repository. This collection dataset 
includes 105 patient’s data with 31,935 total number of images, 256 number of studies, 461 number of series 
and 19.0 GB images size. The TCIA COVID-19 dataset provides three study modality computed radiography 
(CR): (26 images (0.1%), 19 patients), direct digital capture (DX): (236 images (0.7%), 100 patients) and 
computed tomography (CT): (31673 images (99.2%), 23 patients). In addition, the dataset provides different 
images size between 350x552 to 3366x3006 using DICOM standard format. Figure 1 shows image samples 
description of the TCIA COVID-19 dataset. 


EJ 


Patient ID: COVID-19-AR-16406491 Patient ID: COVID-19-AR-16406498 
Age: 49 Age: 59 

Sex: F Sex: F 

Study Modality: DX Study Modality: DX 


Patient ID: COVID-19-AR-16406488 Patient ID: COVID-19-AR-16406502 


Age: 67 Age: 88 
ex: Sex: F 
Study Modality: DX Study Modality: DX 


Figure 1. DICOM images samples of TCIA COVID-19 dataset 
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2.2. The QR code 

In this paper, the QR Code generator is based on the zebra crossing (ZXing) library [23]—[25]. The 
QR code generator is a software that creates data into a QR code image using the format information of two 
things: the error correction level and the mask pattern used for the symbol. The mask patterns are specified 
on a grid that is repeated as necessary to cover the whole symbol and protected from errors with a Bose- 
Chaudhuri-Hocquenghem (BCH) code and a double of complete copies are included in each QR pattern. 
Hence, ZXing is an open-source library project implemented in Java, with ports to other languages which 
supports generating and decoding of multi-format 1D/2D barcode image processing such as QR code and 
data matrix within images and all files can be imported on the fly from a maven repository or can be 
downloaded via a command. Figure 2 shows the generated four QR codes of various size which are used as 
an embedded image within the DICOM files. 


Patient_ID:COVID-19-AR-16406494 a. fue LY, Patient_ID:COVID-19-AR-16406513 


Study Modality: CR ‘é = OLE wel ee Study Modality: CR, DX 
image Study Description: XR CHEST ‘ 
AP PORTABLE 

Radiologist KEY FINDINGS: 
Worsening bilateral airspace 
disease is seen within the lower 
lobes 


Patient_ID:COVID-19-AR-16406561 
Study Modality: DX 

Image Study Description: XR CHEST 
AP PORTABLE 

Radiologist KEY FINDINGS: Hazy 
airspace opacities are seen 


Image Study Description: XR CHEST 
PA AND LATERAL 

Radiologist KEY FINDINGS: Patchy 
increased opacity in the lower 
lobes bilaterally, more pronounced 
on the lateral view. Small pleural 


Patient_ID:COVID-19-AR-16406559 
Study Modality: DX 

Image Study Description: XR CHEST 
AP PORTABLE 

Radiologist KEY FINDINGS: ill- 
defined densities at the lung bases 


throughout the bilateral lower nf ote, ¢ J ` left greater than the right 
lungs, right greater than left. "i ¢ 


Figure 2. QR code examples of various size and corresponding decoded text 


3. THE PROPOSED METHOD 

Figure 3 shows a high-level overall view of the main parts of the proposed method to embed patient 
information into the DICOM image. The main idea of this method is to generate a new image (stego) whose 
pixels are combination of two images with the same size where one of them is used as cover and the other is 
the secret message (Msg). The generated stego image should be similar to the cove image with little 
distortion as possible. Then this stego image could be blindly used to produce the embedded massage without 
the existence of the original image. The proposed method includes two parts: data embedding and extraction 
process. 


3.1. Data embedding process 

In this process, the data embedding process mainly focuses on embedding the QR code image into 
the DICOM image. The proposed method collects patient information from the DICOM file and the 
corresponding clinical data such as patient ID, image study description, study modality and radiologist key 
findings. Next, the proposed method generates a QR code image using the collected patient data then it 
resizes the QR code image to be compatible with the cover DICOM image. As a preprocessing stage, both 
the QR code image and the cover DICOM image are normalized and adjusted to get float pixel values whose 
pixels have float values that belong to the interval [0, 1] instead of the integer range and to avoid an overflow 
in the embedded stego coefficients of saturated pixel values (0 or 1) using a small value (a) respectively. 


Normalization(Img) = 90m) 


(1) 


max(Img)—Min(Img) 
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= > 
Adjustment(Img) = Le j A 


(2) 
After normalization and adjustment are done, in preprocessing stage, the resultant cover and QR code images 
are divided into small parts of 4x4 matrices called micro-blocks (MB). Then the embedding process stage is 
ready to start. 

In the embedding process, the main aim of this step is to embed the normalized MB of the QR code 
image into the corresponding normalized MB of the cover DICOM image to construct integrated stego 
DICOM image. Thus, 2-D discrete cosine transform (DCT) is applied on the cover’s MBs. 


DCT coef ficients =T X MBeoyer X T' (3) 
1 1 1 1 
v4 v4 v4 v4 


E Cos&) E Cos) E Cos) E Cos) 
i Cos È E Cos(=) 1 Cos(=) E Cos() 
E Cos) E Cos) E Cos (5) E Cos) 


The resultant matrix consists of 4x4 coefficients; one approximation and fifteen details coefficients. Next, the 
embedding process is carried out on the cover DCT coefficients by replacing these coefficients with the 
corresponding normalized QR code image’s MB pixel values using (4). 


Stego coef ficient = sign(DCT coef ficient coyer) [2(MBor code +i), 


4 s|DCT coef ficientcover|=| (4) 


Vi=0,1,2,...,B—1 


Collect Patient Generate QR 


Image Resize 
Information y Code 


DICOM Image e 


4 


DCT A Micro Blocks MB Adjustment n Normalization 
N 4x4 N — N 
l 
Embeddi Inv DCT De- 
m ing ay nv ay Normalization E Stego DICOM 
——————E ——————— es” 


ad 


DCT Micro Blocks MB @& N lizati 4 
g 4x4 prthldeselths ht . Stego DICOM 


Extraction De- QR Code 
Normalization 


Figure 3. A high-level overall view of the proposed method to embed patient information into DICOM image 
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The absolute values of the DCT coefficients lies in the interval [0, 2] where the range is actually 
determined by the minimum and maximum for each coefficient value Thus, P is an additional parameter that 
will be used to divide the range [0, 2] of the absolute value of the cover DCT coefficients. To improve stego 
image quality against noise, the approximation coefficient value of the cover image is kept without 
modification during the embedding process for each MB. After the embedding process is done, the stego 
DICOM image is obtained by applying the inverse DCT (Inv. DCT) then denormalization is applied to 
convert the pixel values back to their original domain using (5). 


MBstego = T' X DCTstego coef ficients x T (5) 
Finally, the resultant stego image is saved as a medical image in DICOM format. 
3.2. Data extraction process 

The steps of extraction process are exactly the inverse of those followed during the embedding 
phases. Therefore, these process starts by read the stego DICOM image then apply the normalization to 
convert the pixel values to [0, 1] domain. Next, the normalized stego DICOM image is divided into 4x4 MB 
and compute the DCT decomposition using (6). 


DCT coef ficients =T X MBstego XT’ (6) 


After that, the extraction stage is applied on the stego DCT coefficients: 


MBprec-QR code = Ê (pcr coef ficientstego — 2), Zeper coef ficientstego| E” (7) 


vi=0,1,2,...,B—1 


where ß is the same integer number is used in the embedding stage. In addition, to correctly reconstruct the 
first element in MBrec-or code, the mean value of three neighbors is calculated and replaced on it. Finally, the 
resultant Rec-QR code is denormalized to set the pixel values back to their original domain which is similar to 
the original QR code image. 


4. RESULTS AND DISCUSSION 

This section discusses and analysis the results of the proposed method using the TCIA COVID-19 
dataset, which was obtained from [19]-[22] that includes 105 patient’s data with 31,935 total number of 
images, 256 number of studies, 461 number of series and 19.0 GB images size. The simulation was 
implemented using MATLAB version: 9.3.0.713579 (R2017b) with the image processing toolbox and 
includes read DICOM image function for image importing and conversion. 

Figure 4 shows the DICOM image, QR code image, stego DICOM and reconstructed QR code 
image samples from two patients’ TCIA COVID-19 images of the proposed method. Figures 4(a) to (d) 
illustrate the cover DICOM and the QR code images respectively however Figures 4(e) and 4(f) show the 
resultant stego DICOM image after embedding the QR code image into the cover DICOM image. Moreover, 
Figures 4(g) and 4(h) show the reconstructed QR code image after applying the extraction process from stego 
DICOM images by an authorized person. The simple visual inspection of the results shows a very high 
quality of the stego DICOM images while providing a scannable visual quality for the retrieved QR code 
image. 

The performances of proposed method were evaluated using the peak signal to noise ratio (PSNR), 
the structural similarity index (SSIM), the universal quality index (UQDI), and the correlation coefficient (R) 
values, whose details were explained: 


2 
PSNR = 10 log; (2) 


MSE 


(8) 
1 M-1N-1 3 
MN (F(xiy;) - a(xoy;)) 


O 


MSE = 


i= 


< 


where f(x, y) and g(x, y) defined the cover and stego patients’? images. M and N are the height and width of 
the images. The peakval is the range of the image data type (e.g., for uintl6 image it is 65535). Figure 5 
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shows the resultant mean of PSNR values to compare between cover and stego DICOM images in decibel 
(dB). The mean PSNR values ranged between 63.47 dB and 81.97 dB for various values of B. The minimum 
and maximum value of the mean PSNR are 50.41 dB and 95.33 dB respectively. Generally, PSNR higher 
values refers that the invisibility of higher quality. 


Stego DICOM 


DICOM image _ QR code image 


Patient ID: COVID-19-AR- Beta = 100 


16406494 are ENGE PSNR = 75.34698718 Size: 1760*1760 
Study Modality: CR HOA KE ʻi ese SSIM = 0.999965812 
Image Study Description: : E 3 PaRa UQI = 0.991518428 
XR CHEST AP PORTABLE airspace disease is seen R = 0.996821832 
Size: 1760*1760 within the lower lobes BER ratio = 0.259107344 
(a) (b) (c) (d) 


Patient ID: COVID-19-AR- Size =69*69 Beta = 1000 Size: 2846*2330 
16406561 Radiologist KEY FINDINGS: PSNR = 66.38702116 
Study Modality: DX Hazy airspace opacities are SSIM = 0.999794646 
Image Study Description: seen throughout the UQI = 0.999999968 
XR CHEST AP PORTABLE bilateral lower lungs, right R = 0.999987386 
Size: 2846*2330 greater than left. BER ratio = 0.245096991 
(e) (f) (g) (h) 


Figure 4. DICOM images samples of COVID-19-AR- 16406494 and COVID-19-AR- 16406561 and the 
results DICOM images of the proposed data hiding method: (a) to (d) illustrate the cover DICOM and the QR 
code images respectively, (e) and (f) the resultant stego DICOM image after embedding the QR code image 
into the cover DICOM image, (g) and (h) the reconstructed QR code image 


The PSNR is approach to human observation of invisibility quality with the human visual system 
(HVS). On the other hand, the structural similarity (SSIM) index is a perceptual metric that quantifies image 
quality degradation as perceived change in structural information [26], [27]. In addition, the universal image 
quality index (UQI) proposed by Wang and Bovik evaluates the loss of correlation, luminance distortion, and 
contrast distortion [28], [29]. Therefore, the performances of SSIM and UQI are better than the MSE and 
PSNR. Equations (9) and (10) illustrate the SSIM and the UQI in details: 


(2uxuy+c1)(20xy+c2) 
(ug+u},+c1) (of+0}+c2) 


SSIM (x,y) = (9) 


UQI(x,y) = _ Gaby 20 xy) (10) 


(ug+uZ) (0% +03) 


where ux and uy are the local means, 67x and oy are the variance of x and y. oxy is the cross-covariance for 
images x, y. cı and c2 are variables to stabilize the division with weak denominator. The correlation 
coefficient (R) is widely used in statistical analysis to compare two images in image processing purposes 
[30]-[33]. Equation (11) describes R in detail: 
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= Xx XyCxy-f)-(oxy- 9) (11) 
V(x Dy(fxy- PAxx- g)> 


where f and g are the mean of cover and stego patients’ images pixels. Figure 6 shows the mean of the 
obtained SSIM to compare between the cover and resultant stego images. The minimum and maximum mean 
values of the SSIM confined between 0.979 and 1 while the overall mean SSIM values between 0.9964 
and 1. Figure 7 and Figure 8 illustrate the obtained results of the mean UQI and the mean R respectively. The 
minimum and maximum values of the mean UQI are ranged between 0.48 and 1 while the minimum and 
maximum values of the mean R are recorded between 0.93 and 1. From the obtained PSNR, SSIM, UQI and 
R results, the value of the parameter B always effects on the resultant quality of the stego patients’ image 
where the high B values produce high quality stego DICOM image. 


ZAA Minimum E Minimum 
Mean 1.005 E Mean 
ER Maximum Maximum 
1004 ——— 
1:000 Ho EH EE e ye = a E 
4 1 = Š = = = = = Sri 
sel p i 0.9954 BE mE Neti E Sej Enf E 
Emi = = = = — = 
m : c 0.990 ig E sti] Nae NE St E 
T =. 2 Sep Sup ES SH) Sey SH Sal = 
4 j = =} = = = = = z = 
> E s | td | 
Z $ |] = = : Stil WS NS =| 
D Smi sui = - 
2 40 Eii HE o) 0.980 EH NE ZH NE | = 
Ly i EEEa 
] : : 0.975 a SH Nal NSH Rau NE 
20 =a: j a - = iii = = = z z i = 4 - = = 
= a = į 0.970 H A St] smg Nat Sug Sug Smi S 
E = i = n- =H 1 = = = 
o a : = 0.965 -AP A Be SE ` 7 


100 200 300 400 500 600 700 800 900 1000 100 200 300 400 500 600 700 800 900 1000 


B B 


Figure 5. The mean of resultant PSNR for stego and Figure 6. The mean of resultant SSIM for stego and 


cover patients’ DICOM images cover patients’ DICOM images 
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0.8 : : mi il H ia A E = Lif 
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Figure 7. The mean of resultant UQI for stego and Figure 8. The mean of resultant R for stego and cover 
cover patients’ DICOM images patients’ DICOM images 


Finally, the bit error rate (BER), returned as a scalar which is the number of bit errors to the total 
number of bits used in the binary representation. Figure 9 shows the mean BER between the cover and stego 
patients’ DICOM images for various values of B. The mean BER values ranged between 0.1632 and 0.2820. 
The minimum and maximum values of the mean BER are 0.03 and 0.34 respectively. 
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Figure 9. The mean of resultant BER for stego and cover patients’ DICOM images 


5. CONCLUSION 

Most DICOM objects contain images and associated demographic and medical information about 
the patient, which need to be kept confidential. Cryptography and steganography are generally used to ensure 
medical information of patients’ security. Therefore, this study aims to hide patient’s privacy details of 
DICOM files using the QR code images. Different sized 31,935 images of 105 patients were used as cover 
and embedded QR code images. The main idea of this method is to generate a new image (stego) whose 
pixels are combination of two images with the same size where one of them is used as cover and the other is 
the secret message (Msg). The generated stego image should be similar to the cove image with little 
distortion as possible. Then this stego image could be blindly used to produce the embedded massage without 
the existence of the original image. The performances of proposed method were evaluated using PSNR, 
SSIM, UQI, R and BER metrics. The mean PSNR values ranged between 63.47 dB and 81.97 dB. 
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